A Class-Based Agreement Model for Generating Accurately Inflected Translations
نویسندگان
چکیده
When automatically translating from a weakly inflected source language like English to a target language with richer grammatical features such as gender and dual number, the output commonly contains morpho-syntactic agreement errors. To address this issue, we present a target-side, class-based agreement model. Agreement is promoted by scoring a sequence of fine-grained morpho-syntactic classes that are predicted during decoding for each translation hypothesis. For English-to-Arabic translation, our model yields a +1.04 BLEU average improvement over a state-of-the-art baseline. The model does not require bitext or phrase table annotations and can be easily implemented as a feature in many phrase-based decoders.
منابع مشابه
Enhancing Morphological Alignment for Translating Highly Inflected Languages
We propose an unsupervised approach utilizing only raw corpora to enhance morphological alignment involving highly inflected languages. Our method focuses on closed-class morphemes, modeling their influence on nearby words. Our languageindependent model recovers important links missing in the IBM Model 4 alignment and demonstrates improved end-toend translations for English-Finnish and English-...
متن کاملAgreement Matters: Challenges of Translating into a Morphologically Rich Language, and the Advantages of a Syntax-Based System
Consider the following (simple) English sentences: “I drive a car.”, “I don’t know how to drive”, “I wash the car”, “I wash the floor”. Translating them to Hebrew using Google’s statistical MT system, yields: zipekna bdep ip` (I drive(masculine) a car); bedpl zr ei `l ip` (I don’t know(feminine) how to drive); ugex ip` zipeknd z` (I wash(masculine) the car); and dtvxd z` zthey ip` (I wash(femin...
متن کاملRecurrence Relations for Moment Generating Functions of Generalized Order Statistics Based on Doubly Truncated Class of Distributions
In this paper, we derived recurrence relations for joint moment generating functions of nonadjacent generalized order statistics (GOS) of random samples drawn from doubly truncated class of continuous distributions. Recurrence relations for joint moments of nonadjacent GOS (ordinary order statistics (OOS) and k-upper records (k-RVs) as special cases) are obtained. Single and product moment gene...
متن کاملA Mthod for Generating the Turbulent Intermittency Function
A detection method based on sensitization of a squared double differentiated signal is developed which discriminates the turbulent zones from laminar zones quite accurately. The procedure adopts a variable threshold and a variable hold time of the order of the Kolmogorov time scale. The output file so generated, includes all the information for further analysis of the turbulent signal.
متن کاملApplying Catford’s Category Shifts to the Persian Translations of Three English Romantic Poems
This research aimed at evaluating the types and frequency of category shifts in the Persian translations of English poems based on Catford’s model of shifts. To this end, three English romantic poems of A Histo- ry of English Literature, namely, Blake’s ‘The Chimney Sweeper’, Coleridge’s ‘Kubla Khan’, and Keats’ ‘To Autumn’ along with their Persian t...
متن کامل